Semi-supervised learning for image classification
نویسنده
چکیده
Object class recognition is an active topic in computer vision still presenting many challenges. In most approaches, this task is addressed by supervised learning algorithms that need a large quantity of labels to perform well. This leads either to small datasets (< 10, 000 images) that capture only a subset of the real-world class distribution (but with a controlled and verified labeling procedure), or to large datasets that are more representative but also add more label noise. Therefore, semi-supervised learning is a promising direction. It requires only few labels while simultaneously making use of the vast amount of images available today. We address object class recognition with semi-supervised learning. These algorithms depend on the underlying structure given by the data, the image description, and the similarity measure, and the quality of the labels. This insight leads to the main research questions of this thesis: “Is the structure given by labeled and unlabeled data more important than the algorithm itself?”, “Can we improve this neighborhood structure by a better similarity metric or with more representative unlabeled data?”, and “Is there a connection between the quality of labels and the overall performance and how can we get more representative labels?”. We answer all these questions, i.e., we provide an extensive evaluation, we propose several graph improvements, and we introduce a novel active learning framework to get more representative labels.
منابع مشابه
Semi-Supervised Learning Based Prediction of Musculoskeletal Disorder Risk
This study explores a semi-supervised classification approach using random forest as a base classifier to classify the low-back disorders (LBDs) risk associated with the industrial jobs. Semi-supervised classification approach uses unlabeled data together with the small number of labelled data to create a better classifier. The results obtained by the proposed approach are compared with those o...
متن کاملFully Polarimetric SAR Image Classification Using Different Learning Approaches
This paper compares multilook Polarimetric SAR (PolSAR) image classification using three types of learning: a supervised, an unsupervised and a semisupervised. The multilook PolSAR pixel values are complex covariance matrices and they are described by mixtures of Wishart distributions. Tests in synthetic and real images showed that the supervised and semisupervised classifications provided the ...
متن کاملSemi-supervised Learning for Multi-label Classification
In this report we consider the semi-supervised learning problem for multi-label image classification, aiming at effectively taking advantage of both labeled and unlabeled training data in the training process. In particular, we implement and analyze various semi-supervised learning approaches including a support vector machine (SVM) method facilitated by principal component analysis (PCA), and ...
متن کاملSemi-supervised multi-label image classification based on nearest neighbor editing
Semi-supervised multi-label classification has been applied to many real-world applications such as image classification, document classification and so on. In semi-supervised learning, unlabeled samples are added to the training set for enhancing the classification performance, however, noises are introduced simultaneously. In order to reduce this negative effect, the nearest neighbor data edi...
متن کاملSemi-supervised Marginal Fisher Analysis for Hyperspectral Image Classification
The problem of learning with both labeled and unlabeled examples arises frequently in Hyperspectral image (HSI) classification. While marginal Fisher analysis is a supervised method, which cannot be directly applied for Semi-supervised classification. In this paper, we proposed a novel method, called semi-supervised marginal Fisher analysis (SSMFA), to process HSI of natural scenes, which uses ...
متن کاملSemi-supervised subclass support vector data description for image and video classification
In this paper, an One-Class Classification method, namely the Semi-Supervised Subclass Support Vector Data Description, is presented. The proposed method extends Support Vector Data Description by two means, i.e. by exploiting global class information expressed by the class data variance and local neighborhood information between all available (labeled and unlabeled), following the smoothness a...
متن کامل